CDS
Accession Number | TCMCG024C56329 |
gbkey | CDS |
Protein Id | XP_022039342.1 |
Location | join(168806633..168806645,168806722..168806912,168807713..168807896,168808009..168808146,168808253..168808423,168808512..168808615,168808692..168808759,168809233..168809399,168809989..168810058,168810203..168810357,168810440..168810542,168810624..168810711,168810788..168810931,168811116..168811343) |
Gene | LOC110941952 |
GeneID | 110941952 |
Organism | Helianthus annuus |
Protein
Length | 607aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA396063 |
db_source | XM_022183650.2 |
Definition | serine protease SPPA, chloroplastic [Helianthus annuus] |
EGGNOG-MAPPER Annotation
COG_category | OU |
Description | protease |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko01000 [VIEW IN KEGG] ko01002 [VIEW IN KEGG] |
KEGG_ko |
ko:K04773
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway | - |
GOs | - |
Sequence
CDS: ATGGGTTCATCAAAGGACAACAATGGCGATCTCAAATCCAAAATGGATGTTAACGGAGGCGAAAATGAGTATCCAAGTGGCGAATTCGAGTACAAAACTCCAACTGCTTGGAAGAGCTTCATGGTAAACCTACGCATGCTAGTTGCTTACCCATGGCTCAGGGTTCGCAAGGGTAGCGTATTGTATATTAAACTGCGCGGCAAGATAACTGATCAATTAAAGAGTCGTTTCTCTTCCGGTTTATCACTACCACAATTATGTGAAAACTTGATTAAGGCAGCGTATGATCCTCATATATCTGGTGTTTATCTCCACATTGAAACCTTGAATTGTGGATGGGGTATAACTGATGAAATCAGAAGGCACATATTGGATTTTAAGAAGTCAGGGAAGTTCATTATCGGTTACGCACCTGTATGGCATGAAAAGGAGTATTACCTCGGATGTGTCTGTGACGAGTTCTACGCCCCTCCAAGTGCCTACTTTTCATTGTATGGTTTTTCTAGAGGAGCGTCGTTTTATGGAGGTGTATTTGAGAAAATAGGTGTGGAACCACAAGTGCATAGGATTGGTAAGTATAAGACTTTTGGCGATATGTTAACTCGCAAGAATATATCCGAAGAAAATCGTGAGGTGCTTACTACAATCCTGGATGATGTCTACGAGAATTGGGTCGATAAGGTTTCTCAAGCCAAAGGAAAGAGTAAGGAAGAAATCAAGAGTTTTATTAATGAAGGAGTTTACCAAATAGATAAGTTGAAGGAAGATGGATGGATAACAGATATCAAATATGATGATGAGGTTAAATCTATGTTGAAAACAAGATTATGCATTGCTGAGAAGAAAAAATTTACACTTATTGAATACAAGAAATACTCGAGAATCAGGAAATGGAGTGTGGGGTTATCAGATGGAAAAGACCGAATTGCGGTAATTAGAGCTTCTGGTAGCATTACTCGTGTAGGAGGGTCGTTTTTTACGCCTAGTTCAGGCATCGTAGCTGAACAATTCATCAAAAAGATTAGCAAAGTAAGAGATTCAAAAAGGTATAAGGCCGTTATCATCCGAATTGATAGCCCTGGGGGTGGTCATGTTGCTTCTGACCTGATGTGGAGGGAAATCAAACTATTGGCAGAATCCAAGCCTGTAATTGCATCAATGGTTGACGTGGCCGCAAGTGGAGGATACTACATGGCAATGGCGGCGAATGCTATAGTCTCCGAGAATCTTACTTTAACGGGCTCAATTGGTGTAGTCTCATTGAATTACAATTCGGAGAAACTATTTGAAAAGATTGGTTTCAACAAAGAAGTTATATCAAAGGGACGATATGCTGAGCTGTTTACCGATAACCGGTCATTCAGACCTGATGAAGAGAAACTGTTTGCGGAGCGTGCCCAGAATATTTACGAACGCTTTCGTGAAAAGGCAGCATGTTCCAGATCAATGAGTGTGGAAGAGATGGAAGAGATAGCTCAAGGGAGAGTATGGAGTGGTAAGGATGCTGCTTCACGAGGTTTAGTTGATGCAATCGGAGGCTTTTCACGGGCTGTTGCTATAGCCAAACACAAGGCCAACATACCTCACAACAAACAGGTCGCACTGGTTGAGCTTTCGAAACCATCACTATCTATACAAAAATTCCTATTTGGCATGTTGAGCTCAGCAATCGGAATAGACAAAACACTAAAGCATCTGCAGGGTGATTTTGCAACGAGCGACGAGGTGCAAGCACGCATGGATGGAGCCATGTTTCATGGGTCAGGAGGATCATCTGCGGTCCCTAATTTCGGTTTTCTAAAAGACTACGTAGCTTCTCTTTGA |
Protein: MGSSKDNNGDLKSKMDVNGGENEYPSGEFEYKTPTAWKSFMVNLRMLVAYPWLRVRKGSVLYIKLRGKITDQLKSRFSSGLSLPQLCENLIKAAYDPHISGVYLHIETLNCGWGITDEIRRHILDFKKSGKFIIGYAPVWHEKEYYLGCVCDEFYAPPSAYFSLYGFSRGASFYGGVFEKIGVEPQVHRIGKYKTFGDMLTRKNISEENREVLTTILDDVYENWVDKVSQAKGKSKEEIKSFINEGVYQIDKLKEDGWITDIKYDDEVKSMLKTRLCIAEKKKFTLIEYKKYSRIRKWSVGLSDGKDRIAVIRASGSITRVGGSFFTPSSGIVAEQFIKKISKVRDSKRYKAVIIRIDSPGGGHVASDLMWREIKLLAESKPVIASMVDVAASGGYYMAMAANAIVSENLTLTGSIGVVSLNYNSEKLFEKIGFNKEVISKGRYAELFTDNRSFRPDEEKLFAERAQNIYERFREKAACSRSMSVEEMEEIAQGRVWSGKDAASRGLVDAIGGFSRAVAIAKHKANIPHNKQVALVELSKPSLSIQKFLFGMLSSAIGIDKTLKHLQGDFATSDEVQARMDGAMFHGSGGSSAVPNFGFLKDYVASL |